Localizing Text and Symbols in Images from Biomedical Journal Articles

نویسنده

  • J. Li
چکیده

Automatic localization and recognition of text and symbols in images found in biomedical journal articles could significantly improve indexing and retrieval of biomedical literature, thus contributing to clinical decision support. Main difficulties in automatic localization of text and symbols in medical images are in the irregularity of their occurrence and in the variety of font features. The difficulties are compounded by image quality, image background interference, arbitrary location, and variability in the text block size. We present results of automatic localization and annotation of text and symbols in medical images. Our methods take advantage of gross image features and automatically identified image modality (classification of images into 4 broad types: color, illustration, radiographic and other.) 2D adaptive noise removal Wiener filtering is used as preprocessing step to reduce the image noise. Automatic histogram thresholding, morphological method, Quadtree technique, DCT, and connected component analysis are selectively used on different image types for extracting text and symbol locations. Text area merging and region growth techniques are used as post-processing methods to improve the precision of the bounding box locations. Initial experiments on 100 images achieve precision and recall of 78.42% and 89.38%, respectively, with an average accuracy of 72.02%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

Automatic identification of ROI in figure images toward improving hybrid (text and image) biomedical document retrieval

Biomedical images are often referenced for clinical decision support (CDS), educational purposes, and research. They appear in specialized databases or in biomedical publications and are not meaningfully retrievable using primarily textbased retrieval systems. The task of automatically finding the images in an article that are most useful for the purpose of determining relevance to a clinical s...

متن کامل

Biomedical article retrieval using multimodal features and image annotations in region-based CBIR

Biomedical images are invaluable in establishing diagnosis, acquiring technical skills, and implementing best practices in many areas of medicine. At present, images needed for instructional purposes or in support of clinical decisions appear in specialized databases and in biomedical articles, and are often not easily accessible to retrieval tools. Our goal is to automatically annotate images ...

متن کامل

Exploring use of images in clinical articles for decision support in evidence-based medicine

Essential information is often conveyed pictorially (images, illustrations, graphs, charts, etc.) in biomedical publications. A clinician’s decision to access the full text when searching for evidence in support of clinical decision is frequently based solely on a short bibliographic reference. We seek to automatically augment these references with images from the article that may assist in fin...

متن کامل

Automatic Detection of Arrow Annotation Overlays in Biomedical Images

Images in biomedical articles are often referenced for clinical decision support, educational purposes, and medical research. Authors-marked annotations such as text labels and symbols overlaid on these images are used to highlight regions of interest which are then referenced in the caption text or figure citations in the articles. Detecting and recognizing such symbols is valuable for improvi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007